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C ABSTRACT. Motivated by the Cauchy-Davenport theorem for sumsets, and its inter- 

pretation in terms of Cayley graphs, we prove the following main result : There is a 
universal constant e > such that, if Sf is a connected, regular graph on n vertices, 
then either every pair of vertices can be connected by a path of length at most three, or 
, the number of pairs of such vertices is at least 1 + e times the number of edges in <S . 

We discuss a range of further questions to which this result gives rise. 

in 
(N 

O ' 1- Introduction and Statement of Results 

. Let A be a subset of an abelian group G, written additively, and h a positive integer. 

^ \ The /i-fold sumset hA is defined as 

hA = {g G G : g = a\ + • • • + a h for some a 1; G A}. (1.1) 

We say that A is a basis for G if hA = G for some h. The cardinality of a set S will be 
denoted \S\. The following is a (special case of a) fundamental result in the theory of 
^ . sumsets : 

Theorem 1.1. (Cauchy-Davenport J Let p be a prime and A a subset ofL p . Then 

\hA\ >min{p,h\A\- (h-1)}. (1.2) 

There is a well-known generalisation of this result to arbitrary abelian groups, due 
to Kneser flKj, but that is not what is of primary interest to us here. Instead, we are 
interested in interpreting the Cauchy-Davenport result in terms of graphs. First, recall 
the following definition : 



Definition 1.2. Let G be an abelian group and S a subset of G. The Cayley graphQ 
£f = &(G, S) is the directed graph whose vertices are the elements of G and whose 
edges consist of the ordered pairs (gi, g 2 ) such that g 2 — g\ G S. 

Note that the graph @(G, S) is strongly connected if and only if S is a basis for G. 
For simplicity, let us assume this, plus that the set S is symmetric, i.e.: S = —S, and 
contains the identity element of G. Then we can think of the Cayley graph as being 
undirected, with a loop at every vertex. In this case, let %(G, S) be the part of &(G, S) 
with all the loops removed. For the rest of the paper, we shall only deal with undirected, 
loopless graphs. Now consider the following definition : 
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Definition 1.3. Let £f be a graph on vertex set V and h a positive integer. We denote 
by hfS the graph on vertex set V such that {v\, v%} is an edge in hfS if and only there is 
a path joining v\ to v 2 in & of length at most h. We shall call h& the ^i-fold sumgrapb@ 
ofW and denote its set of edges by h<§ = S{h^). 

Then the following is an immediate consequence of the Cauchy-Davenport 
theorem : 

Theorem 1.4. Let p be a prime, A a subset ofZ p such that G A and A = —A. Let 

& = &o(Z p ,A) be the Cayley graph of A, minus all loops. Then for every positive 
integer h, 

\h^\>minU P 2 Yh\^Y (1.3) 

The question which motivated this paper is whether anything like this result is true 
for more general connected graphs. More precisely, the feature of Theorem 1 .4 that we 
are interested in generalising is the fact that the (edge) sizes of the graphs grow at 
least linearly in h, as long as ^ isn't already too dense. As we shall show below, it is 
hopelessly optimistic to hope for anything like this phenomenon in arbitrary connected 
graphs. However, Cayley graphs have the very important property that they are regular. 
Our main result is the following partial generalisation of Theorem 1 .4 : 

Theorem 1.5. There is a universal constant e > such that if& is a regular, connected 
graph on n vertices, then 

|3<f|>minjY £ Y(l + e)|«?|}. (1.4) 
In fact, we can take e to be the unique positive root of the equation 

e=\(l-^)\ (1-5) 

i.e.: e « 0.087.... 

We were surprised by the simplicity and elegance of this result, which is why we con- 
sidered it worth mentioning. Of course, it is unsatisfactory in many respects so some 
detailed remarks are in order : 



1. The obvious problem with our result is that it cannot be used recursively to ob- 
tain estimates for the growth of /i-fold sumgraphs for arbitrary h. This is because, even 
if the graph £f is regular, then the graphs hfS need not be, for any h > 1 (note that regu- 
larity is preserved for Cayley sum-graphs). Thus it remains to obtain a generalisation of 
Theorem 1.5 to /i-fold sumgraphs for arbitrary h. Note that, for fixed degree, the sum- 
graphs hfS grow at least linearly 'on average' until the graph becomes complete. This is 

This definition is well-known in the literature, though it is standard to use multiplicative notation 
instead, which is natural when one thinks in terms of the adjacency matrix of the graph. So what we are 
calling the /i-fold sumgraph US is usually referred to as the h-th power ofS and denoted & h . Observe 
that if we add a loop at each vertex of and let A be the adjacency matrix of the resulting graph, then 
\£'{'£ h ) | is just half the number of non-zero off-diagonal entries in A h . 

For the remainder of this note we shall retain our additive notation and terminology for graphs so as 
to emphasise the connection to sumsets. 
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a trivial observation, but a more precise result is contained in the next proposition. Re- 
call that the diameter of a graph is the smallest 5 > such that any pair of vertices are 
connected by a path of length at most 5. In the notation of Definition 1.3, the diameter 
of a graph & onn vertices is the smallest h such that hfS = K n , the complete graph. 
Now we have 

Proposition 1.6. Let ^ be a connected graph on n vertices and of minimal degree d. 
Then 

diam (&) < 3n ~^ + 3) , (1.6) 

2. However, the growth of sumgraphs can certainly be irregular. In particular, and this 
is the most natural thing to ask about, there is no constant e' > such that the analogue 
of Theorem 1.5 holds for 2-fold sumgraphs. To see this consider the following 
example : 

Example 1.7. Fix d > and let n be a multiple of d + 1, say n = mid + 1). Let 

y = ^d,m be the following graph on n vertices : Partition the vertex set V into m 
disjoint subsets of size d + 1, say V±, V2, V m . For each i — 1, m pick two vertices 
vn,Vi2 € Vi. Now the graph % m contains the following edges : 

(i) for each i = 1, m, insert all edges among the vertices ofVi, except the edge 
{vn,v i2 }. 

(ii) for each i = 1, ...m — 1, insert the edge {vn, i^j+i)^}, and then finally add the 
edge {v ml ,v 12 }. 

Clearly, this graph is connected and d-regular, so 

Kml = ( fU- (1-7) 



However, one easily checks that 



so that \2Sd >m \ = (1 + o d {l))\S d ^ m \. Note also that for this graph one may check that 

fd 2 + 4\ 

|3^, m \^, m |= [-—- n, (1.9) 



d+1 



so that |3<f d>m | = (3 - o d (l))\S d j 



Considering this example naturally leads one to asking for more precise extremal 
results. We believe that the graphs ^ m are essentially extremal for 2-fold sumgraphs, 
but these latter objects are still somewhat mysterious to us. Motivated by (1.8), we are 
prepared at this stage to conjecture the following : 

Conjecture 1.8. Let d, n be positive integers. If& is a d-regular graph on n vertices, 
then either 2& = K n or 

|2<£V| > (2-o d (l))ra. (1.10) 
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Note that, in the notation of this conjecture, if n > d+2 then trivially > n/2, 

since every vertex is connected to at least one non-neighbor by a path of length two. 
Hence there is a factor of four separating (asymptotically) the trivial lower bound for 
|2<f and what we conjecture to be the truth. 

Neither is it clear to us whether the graphs %,m are essentially extremal for 3-fold 
sumgraphs. The question here is what is the best-possible choice of the constant e in 
Theorem 1.5 ? By (1.9), we can't take e > 2. Indeed, the same conclusion could be 
drawn by considering the Cayley graph of an arithmetic progression. 

Also, note that the graphs % m are certainly not close to being extremal sumgraphs 
in general. This is because it is easy to see that @d,m has diameter m + 1 = + 1, 
whereas from the proof of Proposition 1.6 we will easily be able to construct examples 
which show that the upper bound in (1.6) is essentially best-possible, even for regular 
graphs (see Remark 3.1). Hence, we suspect that the extremal problem for sumgraphs 
in general might be quite hard. 

3. Finally, note that there doesn't seem to be any hope of obtaining meaningful general- 
isations of our results to graphs which are not regular. For example, let n be a positive 
integer and let ^ n be the graph on n vertices which is the union of a complete subgraph 
on [n 3 / 4 J vertices and a path of length n — [n 3 ^ 4 \ which is joined to the complete sub- 
graph at one vertex. This graph is connected and contains 0(n 3 / 2 ) edges but, for any 
fixed h, the /i-fold sumgraph contains only 6 fc (n) additional edges. 

The rest of the paper is organised as follows. Sections 2 and 3 are devoted to the proofs 
and discussion of Theorem 1.5 and Proposition 1.6 respectively. Section 4 contains a 
quick recap of unresolved issues. 



Notation. If is a graph and X C V(<g), then N(X) will denote the set of all 
neighbors of the vertices in X. If X is a singleton set, say X = {x}, then we simply 
write N(x). 

Let d, n be positive integers and let £f be a connected, rf-regular graph on n vertices. 
Let e > be the solution of (1.5) and suppose that |3£f | < (1 + e)|Sf |. We must show 
that 3£f = K n . Since IS C 3<f, we can first of all deduce that < (1 + e)\S \. We 
present the argument in a sequence of steps. 

Step 1 : Set e 1 := yfl. For each v e V(^), let 



2. Proof of Theorem 1.5 



T ■ 



{w e V(&) : {v,w} e 2<g\<g} 



(2.1) 



and let 



{v E V{&) : \T V \ < eid.} 



(2.2) 



Since, by assumption, 




(2.3) 



-I; 



A CAUCHY-DAVENPORT TYPE RESULT FOR ARBITRARY REGULAR GRAPHS 



5 



it follows easily that 

|Vi| > (l-ei)n. (2.4) 

Step 2 : Let v G V t . Set A v : = N(v), B v := N(N(v)) and C v := U A,). If 

the set C v were empty then, since the graph is connected, it would imply that V(& ) = 
{v} U A„ and hence that 2£f = K n . So we may assume that C v is non-empty. If c E C v 
then there is a path i> — > a — > c in Sf , for some a G A, hence {t>, c} G 2£ '. By definition 
of the set V\, it follows that 

\C V \ < eid. (2.5) 

Set D v := \/(^)\({f } UA V \JC V ). Suppose this set were empty. Since <3 is connected, 
it would imply that V(^) = {v} U A v U C v . We claim that, in this case, W = K n . 
We need to show that any two vertices in {v} U A v U C v can be connected by a path 
of length at most 3. This is obvious unless both vertices lie in C v . Consider a pair of 
such vertices, say c\ and c 2 . Our assumptions say that N(ci) U N(c 2 ) Ci^U C v . But 
by (2.5), ci-regularity and the fact that e x < 1/2 (see the statement of Theorem 1.5), it 
follows that c\ and c 2 must have a common neighbor in A v . Hence {ci, c 2 } G 2c?, in 
fact. 

So we may now assume that the set D v is non-empty. There must be at least one 
edge between C v and D v . For any such edge, say {c„, d v }, we know by (2.5) that at 
least (1 — ei)d of the neighbors of c v lie in A V U D v . Let ^ be the set of vertices in C v 
with at least one neighbor in D v and set 

a v := — — ~T~ x max{|iV(c 1 ,) n A„| : c„ G K}- (2.6) 

In the steps to come, we consider the following two cases, at least one of which must 
obviously apply : 

Case 1 : For at least half of all v G V\, one has a v < \. 
Case 2 : For at least half of all v G V±, one has ct v > \. 

Step 3 : Suppose Case 1 holds. Let V 2 :— {v G Vi : a v < \}. For each v G V 2 , 
pick any vertex c v G Then there are at least |(1 — ei)d choices for an edge {c„, d v } 
such that d v G D v . Notice that, for any choice of d v , there is a path v — > a„ — > c„ — > d v 
in ^, for some a v G A„. Hence {f , d v } G 3cf\(f . Summing over all v G V 2 and noting 
that any given pair of vertices is counted at most twice, it follows that 

|3A^I > \ ■ \n ■ ^(1 - > ^(1 - ci) VI > (2-7) 
contradicting our assumptions. 

STEP 4 : Suppose Case 2 holds. Let V 3 := G V 1 : a„ > |}, so that |V 3 | > \\V\. Let 
v G V 3 and fix a choice of a vertex c v G ^ such that c v has at least \ (l — ti)d neighbors 
inside A v . Let d v be any neighbor of c v inside D^,. Observe that all the neighbors of 
d v lie inside C v U Dt,. Hence, by (2.5), there are at least (1 — ei)d choices for a vertex 
e v G N(d v ) fl D v . For any such vertex e v and any vertex a v G A^"(c„) fl A v , there is a 
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path in the graph a v — > c v — > — > e„. Hence {a„, e„} G ?>S\S. Therefore, if we set 

§ := J2 #^ a -' e ^ e :a v eA v ,e v e D v }, (2.8) 

then we have 

|s| > rax (i^ x(1 _ £l)(i >f(lziO!^. (2 . 9) 



On the other hand, since since v G N(a v ) always, any pair of vertices can appear in the 
sum at most 2d times. It follows that 

\M\£\>\(l-ei?\£\>e\£\, (2.10) 
which again contradicts our assumptions, and completes the proof of the theorem. 

3. Proof of Proposition 1.6 

Let <S be a connected graph on n vertices of minimal degree d. Let 5 be the diameter 
of <3 and let v, w be a pair of vertices such that a shortest path between them has length 
exactly 5. Let such a path be 

Vl = v -> v 2 -> > ^ -> = w;. (3.1) 

Let A be the set of vertices along the path and B := V(&)\A. Using the fact that there 
is no shorter path in & between v and w, we shall count in two ways the number e(A, B) 
of edges in between A and 5. On the one hand, this fact implies that there are no 
edges between the vertices along the path other than those in the path itself. Since 
has minimal degree d, it follows that 

e(A, B) > (5 - 1) • (d - 2) + 2 • (d - 1) = (5 + l)(d - 2) + 2. (3.2) 

On the other hand, the absence of a shorter path between v and w means that no vertex 
in B can be joined to more than three vertices of A (and if it joined to exactly three of 
them, then they must be adjacent along the path (3.1)). Hence, 

e(A,B) <3\B\=3(n-5-l). (3.3) 

From (3.2) and (3.3) one easily deduces (1.6). 

Remark 3.1. The proof just given can be easily adapted to construct explicit examples 
of graphs which show that the upper bound in (1.6) is essentially best-possible. Let 
d > 5 be odd for simplicity and choose a non-negative integer k. Let 

a := 3(k + 1), b := k(d - 2) + 2(d - 1) = (k + 2)(d - 2) + 2, n:=a + b. (3.4) 

We construct a rf-regular graph on n vertices as follows. The vertices of £f are parti- 
tioned into two disjoint sets A and B such that \A\ — a, \B\ — b. Denote 

A := {vi, v a }, B := {w!,...,w b }. (3.5) 

The graph will contain the following edges : 

Type 1 : The edges of the path v± — > v 2 v a . 

Type II : All edges {v h wj} such that 1 < % < 3 and 1 < j < d — 1, except the edges 

{v 2 ,Wd-i} and {w 3 ,^i}- 
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Type III : All edges {v ( a +i)-i, W(b+i)-j} such that 1 < i < 3 and 1 < j < d — 1, 
except the edges {v a _i, w 6+2 _ d } and {v„_ 2 , tu&}. 

Type IV : All edges {v Sr+s , W(d-2)r+i+t} such that l<r</c, l<s<3 and 
1 < t < d-2. 

Type V : The complete subgraph on the vertices wi, ...,Wd-i, minus a perfect matching 
on the d — 3 vertices u> 2 , u>d- 2 . 

Type VI : The complete subgraph on the vertices Wb+2-d, ■ ■ ■ , Wf,, minus a perfect match- 
ing on the d — 3 vertices Wb+3~d, Wb-i- 

Type VII : For each 1 < r < k, the complete subgraph on the vertices 

Wr(d-2)+2, W(r+l)(d-2)+l- 

One can readily check that this graph is indeed d-regular and, for k 3> 0, of diame- 
ter a — 1 := 5. Moreover, for d > 5 one has 

a =L 3 "-^ +3 > J. ,3.6) 



4. Concluding Remarks 

There are two obvious directions in which the results of this paper need to be im- 
proved upon. The first is to generalise them to /i-fold sumgraphs for arbitrary h, and 
in particular to understand better the most natural case when h = 2. The second is to 
sharpen them, in particular to obtain the best-possible constant e in Theorem 1.5. Both 
directions naturally lead in turn to Freiman-type inverse problems, where one wishes 
to say something about the 'structure' of regular, connected graphs whose sumgraphs 
grow slowly. 
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